A Retrieval Language for Historical Documents
نویسندگان
چکیده
This paper focuses on a set of structured document applications that we have denoted databases of historical documents. The information into these documents is closely related to the time in which they are created while being still of great usefulness in the future. The main contribution of this paper is the formulation of a group of operators and predicates that express retrieval conditions over the temporal features of documents. Additionally, the resulting retrieval language supports the construction of time series of documents, here named chronicles, which are regarded as a preliminary step towards the implementation of more complex data mining operations over historical document repositories.
منابع مشابه
An Approach to Cross-Age and Cross-Cultural Information Access for Digital Humanities
1. Introduction Since libraries have collection of documents across age and culture, and even language, the libraries are inherently multi-age, multi-cultural, and multilingual. In the digital age, more and more historical documents are being digitized to preserve contents written in deteriorating papers. Library, etc.). It means that more and more old text contents will be accessible on the in...
متن کاملNatural Language Dialogue System for Information Retrieval
The objective of our work is the development of a natural language dialogue system for information retrieval with multimodal input and multimedia output. Overall, the system consists of three phases: input analysis, information and knowledge management and output generation. The dialogue system is designed for consulting old Mexican historical documents. In this paper we describe the designed a...
متن کاملExploiting Semantic Web Technologies for Intelligent Access to Historical Documents
The FDR/Pearl Harbor Project involves the enhancement of materials drawn from the Franklin D. Roosevelt Library and Digital Archives, which includes a range of image, sound, video and textual data. The project is undertaking the encoding, annotation, and multi-modal linkage of a portion of the collection, and enhancement of a web-based interface that enables exploitation of state-of-theart meth...
متن کاملXOR - XML Oriented Retrieval Language
The wide acceptance and rapidly growing use of XML as a standard storage and retrieval data format blurs the historical divide that exists between Information Retrieval and Database Retrieval. On the structured database retrieval side it is now possible to support highly structured access to documents using XML specific tools such as XPath, XQuery, XQL and more. On the information retrieval sid...
متن کاملInformation Access to Historical Documents from the Early New High German Period
With the new interest in historical documents insight grew that electronic access to these texts causes many specific problems. In the first part of the paper we survey the present role of digital historical documents. After collecting central facts and observations on historical language change we comment on the difficulties that result for retrieval and data mining on historical texts. In the...
متن کامل